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ABSTRACT 

Aims. A recent analysis of polarization measurements of HINODE SOT/SP in the quiet Sun pointed out very complex 
shapes of Stokes V profiles. Here we present the first classification of the SOT/SP circular polarization measurements 
with the aim of highlighting exhaustively the whole variety of Stokes V shapes emerging from the quiet Sun. 
Methods, k-means is used to classify HINODE SOT/SP Stokes V profiles observed in the quiet Sun network and inter- 
network (IN). We analyze a 302 x 162 arcsec 2 field-of-view (FOV) that can be considered a complete sample of quiet 
Sun measurements performed at the disk center with 0.32 arcsec angular resolution and 10 -3 polarimetric sensitivity. 
This classification allows us to divide the whole dataset into classes, with each class represented by a cluster profile, 
i.e., the average of the profiles in the class. 

Results. The set of 35 cluster profiles derived from the analysis completely characterizes the SOT/SP quiet Sun mea- 
surements. The separation between network and IN profile shapes is evident - classes in the network are not present 
in the IN, and vice versa. Asymmetric profiles are approximately 93 % of the total number of profiles. Among these, 
about 34 % of the profiles are strongly asymmetric, and they can be divided into three families: blue-lobe, red-lobe, and 
Q-like profiles. The blue- lobe profiles tend to be associated with upflows (granules), whereas the red- lobe and Q-likc 
ones appear in downflows (intergranular lanes). 

Conclusions. These profiles need to be interpreted considering model atmospheres different from a uniformly magne- 
tized Milne- Eddington (ME) atmosphere, i.e., characterized by gradients and/or discontinuities in the magnetic field 
and velocity along the line-of-sight (LOS). We propose the use of cluster profiles as a standard archive to test inversion 
codes, and to check the validity and/or completeness of synthetic profiles produced by MHD simulations. 

Key words, sun: surface magnetism - sun: magnetic topology - techniques: polarimetric - methods: statistical 



1. Introduction 

The Stokes parameters (I, Q, U , and V) provide the main 
tool to analyze the solar photosphere. They are indeed 
molded by the photospheric structure and for this reason 
can tell us about its thermal, magnetic and dynamical prop- 
erties. 

When considering Stokes profiles measured in spectral 
lines that are sensible to magnetic fields via Zeeman effect, 
one could expect the circular polarization profiles (V) to 
be exactly antisymmetric with respect to a central wave- 
length of the transition, possibly shifted by plasma dy- 
namicfl Contrary to such an idealization, the Stokes V 
profiles emerging from the solar photosphere present im- 
portant asymmetries independently of whether the mea- 
surements are obtained with low spatial reso lution (e.g., in 
plage or network regions; for a revi ew see ISolankil fl993) 



or with high spatial resolution (e.g.. ISigwarth et al.lll 999: 



Sanchez Almeida fc Litesl 120001 : iDommguez Cerdeha et al.1 
20061 : Viticchie et alj|201ll ). and both in network and in 
the interior of the network (IN) . 
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lLandi Deel'Innocenti fc Land olfi 2004). 



measurements (Q, and 
be symmetric (see, e.g., 



Stokes V profile asymmetries are well known to be 
generated by gradients along and across the LOS of 
both magnetic field and plasma dynamics. Moreover, to 
deal with strong asymmetries one cannot consider mild 
variations of the above cited quantities, instead fairly 
large changes like discontinuiti es have to be invoked (e.g., 
iGrossmann-Doerth et al.l [2000) . Discontinuities in model 
atmospheres can be produced in many different ways. They 
are found in c a nopy models (e.g. IGrossmann-Doerth et al.l 
2000) or in embedded flux tube mocl- 
Solanki fc Montavonl 119931) . in which one or 
two magnetopauses are present, i.e., sharp transitions 
from a magnetized region to a field-free one. Besides 
these models, one can consider the presence of many 
discontinuities along the LOS in solar magnetic fields 
as being intermittent over scales smaller than the pho- 
ton mean free path at the photosphere (~ 100 km); 
these are t he Micro-Structured Magnetized Atmospheres 
(MIS MAs, ISanchez Almeida et all 119961: ISanchez Almeida! 
1998). Smooth variations of the atmospheric properties can 
also be invoked, but large asymmetries r equire steep gradi- 
ents i n these continuous variations (e.g., ISolanki fc Pahlkd 
119881 : ISanchez Almeida et al.l 119881 ) . The asymmetries are 
also partly caused by discontinuities across the LOS, in 
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other words, an important factor which can introduce 
asymmetries is the finite spatial resolution of observa- 
tions. The way it works remains unclear because rather 
than decreasing the asymmetries with improving angu- 
lar resolution, the asymmetries observed in the quiet Sun 
with 0.32 arcsec seem to be more extreme than those ob- 
tained with 1 arcsec (c.f. ISanchez Almeida fc Lited l2000t 
I Viticchie et al.|[20lTl ). On the one hand, one could expect 
to observe asymmetries mainly caused by gradients along 
the LOS when increasing the spatial resolution of the ob- 
servations. On the other hand, we still do not know over 
which scale t he magnetic field changes i n the solar photo- 
sphere, e.g., in lVoeler fe Schiisslerl (|2007f ) the magnetic field 
changes over approximately 10 km, well below the spatial 
resolution of HINODE. 

The arguments considered above must be taken 
into account when interpreting Sto kes measure- 
ment s through inversion codes (e.g., ISocas-Navarrol 
l200lh . Among these, the ones based on ME hy- 
potheses cannot deal with asymmetries, but are still 
very important since they allow u s to perform fast 
analyses 



o f large datasets (e.g., 
Orozco Suarez fe Del Toro Iniestal 



Sku manich fe Litea 
2007L 



1987t 

Orozco Suarez et al.1 2007a ). On the other hand, sev- 
eral inversion codes that can fit and interpret asymmetric 
profiles are available. The MISMA inversion code ( based 
on MISMA hypotheses, ISanchez Almeida! Il997l ) has 
been successfully used to interpret strong asymme- 
tries in polarization profiles emerging from the quiet Sun 
(ISanchez Almeida fc L ites 2000; iDommguez Cerdena et ah! 
I2006t IViticchie et all |2011| ). Other suitable tools to 
interpret Stokes V asymmetries are the SIR code 
(|Ruiz Cobo fc del Toro Iniestal 11992ft . or the SIR JUMP 
code (a modified version of the SIRGAUSS code, see 
iBellot Rubiol l2003ft . which was already used f or the inter- 
pretation of HINODE data (jLouis et al.ll2009h . 

Since its launch in 2006, the spec t ropol arimeter 
SOT/SP (jLites et all 120011: ITsuneta et all l2008h of the 
J apanese mission HINODE Ikosuei et al.1 120071) allows 
the solar community to perform spectropolarimctry of 
the solar photosphere under extremely stable conditions. 
SOT/SP provides 0.32 arcsec angular resolution mea- 
surements with 10 -3 polarimetric sensitivity. Moreover, 
its wavelength sampling (2.15 pm pixel" 1 ) is suitable 
to have a detailed sampling of the polarization profiles. 
Many inversion analyses of spectropolarimetry measure- 
ments performed by SOT/SP have been performed so far. 
In most cases the hypothesis of ME atmosphere has been 
adop t ed (e.g. 



. . Orozco Suarez et al.l l2007bl : lAsensio Ramos 

I2009t llshikawa fc Tsunetall2009D . Recently. Tviticchie et al. 
( 20111 ) provided the first interpretation of Stokes pro- 
file asymmetries re v ealed in th e dataset analyzed b y 
Orozco Suarez et all (|2007bl) and lAsensio Ramos! (|2009f> ; 
the analysis was performed under MISMA hypotheses. 

The aim of this paper is to further point out and quan- 
tify the complexity of the shapes of SOT/SP measurements 
already discussed in IViticchie et al.l (|201ll ). but using a 
tool independent of any inversion. To do this, we adopt a 
method that allows us to extract and characterize the typi- 
cal shapes of the Stokes V profiles observed by SOT/SP in 
the quiet Sun. As will be emphasized in § |4j a large part of 
these turned out to be strongly asymmetric. The main im- 
plication of this result is that refined inversion techniques 



should be considered in future analyses of SOT/SP data 
to extract further information on the structure of the solar 
photosphere. Those techniques are too burdensome to be 
viable for large datasets, but both the profile classes and 
families we present provide a small, yet complete set that 
is suitable for detailed study. 

The paper is organized as follows: we introduce the 
dataset selected for the analysis in § [5] we describe the 
classification tool adopted to extract the typical shapes of 
the Stokes V profiles from the dataset in § [3] the typical 
profile shapes are presented in § 0] we consider possible 
physical scenarios that could explain these profiles in § [5J 
and reflect on further applications of the classes; finally, our 
conclusions are outlined in § [5] 

2. Dataset 

We analyzed Stokes / and V profiles of a 302 x 162 arcsec 2 
portion of the solar photosphere observed at disk cen- 
ter on 2007 March 10 between 11:37 and 14:34 UT. 
The spectropolarimetric measurements were taken by the 
SOT/SP instr ument aboard HINODE (|Lites et al.l feOQlt 
ITsuneta et al"1l2008l) in the two Fe I 630 nm lines, with a 
wavelength sampling of 2.15 pm pixel -1 , and a spatial sam- 
pling of 0.1476 arcsec pixel -1 and 0.1585 arcsec pixel -1 
along the east-west and south-north directions, respec- 
tively. The data reduction and calibration were performed 
using the sp_prep.p ro routine available in the SolarSoft 
(jlchimoto et al.ll2008l ) . After correction for the gravitational 
redshift, we verified that the cores of the full-FOV average 
Stokes I profiles coincide within the SOT/SP wavelength 
sampl ing with the cores of the two Fe I lines in the Neckel 
atlas (Neckel 1999|. Moreover, the average line cores vary 
across the scanning direction with a standard deviation 
of appro ximately 60 m s -1 , d iscarding the problem men- 
tioned bv lSocas-Navarrol ()201lD . This accuracy of the abso- 
lute wavelength scale suffices because we mostly use relative 
velocities, and the absolute scale is employed only to dis- 
tinguish granules and intergranules (i.e., upflows and down- 
flows). Using the polarization signals in continuum wave- 
lengths, we estimated a noise level of cry ~ 1.1 x 10 -3 I c 
for Stokes V (I c stands for the average continuum in- 
tensity o f the FOV). The da t aset ha s been alrea d y ana - 
lyzed bv lOrozco Suarez et al l (l2007al). iLites et all |2008l) . 
lAsensio Ramosl (|2009f ). and IViticchie et al.l (|2011l ) to de- 
rive the magnetic properties of IN and network regions. 
Consistently with these works, here we focus on Stokes V 
profiles whose maximum amplitude is larger than 4.5 x ay. 
A total of 535465 Stokes V profiles meet the selection crite- 
ria (i.e., 26 % of the total number of profiles in the dataset). 
Such a large number of profiles can be rightly considered 
a complete set of polarization signals representative of the 
different Stokes V profiles that can be observed in the quiet 
Sun at disk center with 0.32 arcsec resolution and 10 -3 po- 
larimetric sensitivity. 

3. Data analysis 

The analysis consists of three main parts. The most im- 
portant one is the k-means classification of the profiles. 
This allows us to infer the typical profile shapes from the 
dataset described in § [2j Besides the classification, we de- 
rived the LOS velocity for each pixel in the FOV adopt- 
ing a Fourier transform method applied to both Stokes / 
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Fig. 1. Classes of Stokes V profiles of Fei 630.15 nm (left) and Fei 630.25 nm (right) in the quiet Sun according to a 
single k-means classification run. All the classes retrieved by the analysis are reported. Each panel contains the average 
(solid lines) and the standard deviation (colored error bars) of all profiles in the class. The dotted vertical line separates 
the two spectral lines (it divides the x-axis in two windows of 40 pixels, see § 13. ip . Each plot is represented in the same 
x — y range, i.e., the one of class 30. The number of the class is specified in each plot together with the percentages of 
profiles in the class. The colors group classes in families of alike profiles (§H]): network (purple), blue- lobe (blue), red- lobe 
(red), Q-like (sky-blue), asymmetric (green), antisymmetric (yellow), and fake (orange). 



lines separately (|Title fe Tarbelllll975l ). and the center-of- 
gravity magnetogra m signal from Stokes / and V (COG, 
iRees fc Semelfl979l) . 

The k-means classification algorithm is commonly used 
in dat a mining, machine learning , and artificial intelligence 
(e.g., lEverittf l995: Bishop 2006). I ndeed, the procedure we 
use is the same as the one used by ISanchez Almeida et al.l 
(|2010T) to analyze galaxy spectra from the Sloan Digital 
Survey Data Release 7. In this work the authors defined the 
procedure as automatic and unsupervised. "Unsupervised" 
implies that the algorithm does not have to be trained, 



whereas "automatic" means that it is self-contained, with 
minimal subjective influence. In the standard formulation, 
k-means begins by selecting at random from the full dataset 
of V spectra a number k of template spectra. Each template 
spectrum is assumed to be the center of a cluster, and each 
spectrum of the data set is assigned to the closest cluster 
center (i.e., that of minimum distance or, equivalently, clos- 
est in a least-squares sense). Once all spectra in the dataset 
were classified, the cluster center is re-computed as the av- 
erage of the spectra in the cluster. This procedure is iter- 
ated with the new cluster centers, and it finishes when no 
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Fig. 2. Histograms with the number of pixels in each class considering network regions (purple dot-dashed line) and IN 
regions (black solid line) separately. The arrows point out the network classes as derived from the statistics. 



spectrum is re-classified in two consecutive steps. The num- 
ber of clusters k is arbitrarily chosen but, in practice, the 
results are insensitive to this selection because only a few 
clusters possess a significant number of members, so that 
the rest can be discarded. On exit, the algorithm provides 
a number of clusters, their corresponding cluster centers, 
as well as the classification of all the original spectra now 
assigned to one of the clusters. The classes are ordered ac- 
cording to the total number of profiles they contain, i.e., 
assigning to the most abundant, 1 to the second most 
abundant, and so on. 

As a major drawback, k-means yields different clusters 
with each random initialization, therefore, one has to study 
this dependence by carrying out different random initializa- 
tions and comparing their results. Partly trying to allevi- 
ate this problem, the code we employ uses an initialization 
more refined than the plain random initialization. It con- 
sists of four steps: (Step 1) choose at random ten initial 
cluster centers; (Step 2) run one iteration of the standard 
k-means, and select as initial cluster center the cluster cen- 
ter with the largest number of elements; (Step 3) remove 
from the set of profiles to be classified those belonging to 
the cluster center thus selected; (Step 4) go to step 1 if pro- 
files are still left; otherwise end. In order to show that the 
method is working properly, in § [4] we present the results 
obtained from a single classification run and then validate 
them by showing that they fully agree with the average re- 
sults obtained when performing ten classification runs ini- 
tialized with different random conditions. 

An important point to be mentioned is that the classes 
provided by k-means do not form an orth ogonal basis in 
the p r incipal componen t analysis sense (e.g. lConnollv et"al"1 
119951: iRees et al.l l2000ft . The individual profiles are not 
unique linear superpositions of the cluster center profiles. 
Indeed, among the clusters found by the analysis, similar 
profiles can be easily recognized (Fig. [I]). However, it is im- 
portant to specify that the aim of our classification is not 
to provide orthogonal classes, but to point out the typi- 
cal shapes of the profiles observed by SOT/SP in the quiet 
Sun. The most important point to keep in mind is that all 
profiles associated to a given class are similar to the cluster 
profile representative of the class and, therefore, the cluster 
profiles are effectively representative of the shapes of the 
profiles observed by SOT/SP. Referring to the similarity 



between each cluster profile and the profiles associated to 
it, it is worth pointing out that profiles strongly deviating 
from the cluster center are discarded. In more detail, when 
the cluster centers are calculated as the average of the pro- 
files in the class, those members with a distance larger than 
three times the standard deviation are rejected. 

3.1. Definition of the input dataset archive 

Given a set of observations, the actual data to be classified 
can be chosen in different ways (e.g., range of wavelengths 
to be included, or whether the profiles are corrected for the 
magnetic polarity). Here we explain the way our dataset 
was defined and why. 

The main idea is to analyze exclusively the shape of 
Stokes V. For this reason we remove the global shift of the 
profiles, renormalize each profile to its maximum unsigned 
value, and sign the profiles according to the main polarity 
in the resolution element. 

Each profile is normalized to its maximum absolute am- 
plitude as 

V(A) — > V(A)/max(|V(A)|), (1) 

where A spans over the whole wavelength range of SOT/SP 
(i.e., 0.24 nm around 630.2 nm.). 

Global wavelength shifts are removed separately for the 
two spectral lines. We extract the Stokes V profiles of the 
two Fe I lines in two wavelength windows that are centered 
around the corresponding Stokes / line cores. The two wave- 
length windows have the same amplitude, i.e. 86 pm, so 
that both lines contribute to the classification with the same 
number of wavelength pixels. In this procedure the number 
of wavelength pixels is reduced from 112 to 80 (i.e., two 
windows of 40 wavelength pixels each). It is important to 
keep in mind that after this step it is not possible to con- 
sider a common wavelength range for the representation 
of Stokes V profiles of the two lines. For this reason, the 
abscissas of our Stokes V profiles are given in pixel units 
rather than wavelengths (see Fig. [1]) . 

Finally, we individually sign the profiles according to the 
polarity that dominates Stokes V, so that if V were per- 
fectly antisymmetric, the polarity-corrected Stokes V would 
have a positive blue lobe and a negative red lobe. Since the 
signals are not antisymmetric, we define dominant polarity 
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Fig. 3. Spatial distribution of the profile families. Each color is representative of a single family according to the color 
code in Fig. [TJ White pixels are the ones with max(|l/|) < 4.5 x ay, which are excluded from the classification analysis. 
Red contours highlight the network regions in the FOV. The inset contains a 15 x 15 Mm 2 detail of the FOV (black 
square) for a better visualization of the small family patches around a network region. The black contours highlight the 
regions with continuum intensity above the average continuum intensity of the FOV, i.e., the granules. 
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as that yielding a V with the sign of the largest average 
signal in the profile. In practice, we average Stokes V sep- 
arately in the red-wing and the blue-wing to decide which 
wing dominates. The division between red and blue is set by 
the Stokes I line core wavelengths. If the (unsigned) blue- 
wing signal is largest, then we divide Stokes V by its sign. 
If the (unsigned) red-wing signal is largest, then we divide 
Stokes V by minus its sign. The result is a V profile with 
the largest signal positive if it happens in the blue-wing, 
and negative if happens in the red-wing (see the profiles in 
Fig. Q] for illustration) . 

4. Results 

The main result of the k-means classification is reported in 
Fig. [TJ It shows the cluster centers for all classes of profiles 
existing in the quiet Sun. After several trial classifications, 
we set the maximum number of allowed classes to 50, which 
exceeds the typical number of classes retrieved by the pro- 
cedure (i.e., around 35, see Tabled]). In this way we are 
sure that the classification procedure is automatically set- 
ting the number of classes. The classification performed on 
the dataset of 535465 profiles took about 20 min on a stan- 
dard laptop. 

We number the classes according to the percentage of 
observed profiles belonging to the class, from the most com- 
mon, class 0, to the least common, class 34. It is impor- 
tant to notice that strongly asymmetric profiles are found 
throughout, e.g., the second most populated class shows im- 
portant asymmetries. Classes of Stokes V with three lobes 
(Q-like profiles) are also common, meaning that profiles like 
this are not isolated cases in quiet Sun measurements. 

As already reported in § [31 similar profile shapes can 
be recognized among the cluster profiles in Fig. Q] Indeed, 
attending to their basic properties the profiles can be 
grouped in six families: network, blue-lobe, red-lobe, Q-like, 
asym metric, and antisymmetric (partially following [Steiner] 
2000). With basic properties we refer to the major features 
of profiles, e.g., blue- lobe profiles have a very small red lobe 
so that, independently of the amount of polarization in the 
blue lobe, they are dominated by the blue lobe itself. 

The first family, i.e., the network one, can be defined by 
calculating the statistics for the k-means classes when net- 
work and IN regions are considered separately, i.e., for the 
pixels included/not-included in the red contours of Fig. |3j 
The contours separate network and IN regions in the FOV. 
N etwork patches were i dentified with the algorithm used 
in lViticchie et al.l (|201lh . We refer to this work for details, 
but it basically selects the patches with the largest polar- 
ization signals that all together cover some 10 % of the 
FOV 0. The statistics is reported in Fig. [2 here the net- 
work classes are pointed out by arrows. More in detail, the 
network-IN division performed by the routine has an equiv- 
alent division in the k-means classes. Indeed, the classes 
that are abundant in the network (i.e., those pointed out 
by the arrows) are scarce in the IN, and vice-vers eB These 
results imply that the network is associated with particu- 
lar Stokes V shapes that are not found elsewhere. In other 

2 The network-IN separation is somewhat arbitrary. Another 
definition would make the network patches larger, including 
classes with larger asymmetries (see, e.g., the inset in Fig. [2j). 

3 One class is found to be shared by network and IN regions 
(i.e., class 7), we decided to consider it as a network class. 
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Fig. 4. Statistics of the LOS velocities (vi os ) of the pix- 
els associated with each family of Stokes V shapes. Each 
plot contains the histograms of both Fei 630.15 nm (thick 
lines) and Fei 630.25 nm (thin lines). The histograms are 
normalized to the total number of profiles in the family. 
The vertical dotted lines separate the upfiow regime (left) 
and the downflow regime (right). 

words, a classical network identification and the k-means 
classification point out the same regions with two different 
methods: the first one is mainly based on the analysis of 
the spatial coherence of strong Stokes V signals, whereas 
the second one does not take into account the amplitude 
of Stokes V (profiles are normalized before classification, 
see § 13. ip . and refers just to profile shapes. The "network 
family" contains the following group of k-means classes: 
classes 0, 3, 4, 6, 7, 12, and 19 (the cluster profiles of 
these classes are reported in Fig. [T]with purple error bars). 
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These are representative of about 28.6 % of the total num- 
ber of analyzed pixels. By examining the shapes of the clus- 
ters in the network family one can notice that these profiles 
present the blue lobe larger than the red one, while the lat- 
ter is more extended in wavelength (its area is larger than 
the area of the blue lobe) . Such a sha pe is well known to 
be associated to network reg ions (e.g.. IStenflo et al1ll984t 
ISanchez Almeida et al.lll996l ). Moreover, the amplitude of 
the Stokes V profiles in the two lines is very similar in 
all network classes; this highlights the fact that a satura- 
tion due to kG fi elds is present, as expected in network re- 
gions (see, e .g., ISocas-Navarro fc Sanchez Almeidal l2002t 
IStenflol feOlOl h As explained above, each profile emerging 
from HINODE pixels with sufficient signal is associated to 
a class and, by grouping of classes through a certain cri- 
teria, to a family. In Fig. [3J we identify the position of a 
network profile with a purple pixel. The map shows how 
the network contours highlight large purple patches. 

On the other hand, clusters profiles in IN pixels present 
a huge variety of asymmetries. As an example, class 1, 
the most abundant class in the IN, is represented by a clus- 
ter profile dominated by the blue lobe. Many other classes 
are represented by profiles dominated by a single lobe, ei- 
ther the blue one or the red one (here always negative be- 
cause of the condition imposed in § 13. The profiles in 
these classes represent approximately 34.4 % of the ana- 
lyzed pixels. Among these, 17.5 % are the profiles domi- 
nated by the blue lobe, named the "blue- lobe family", i.e., 
class 1, 5, 10, 15, and 28 (represented with blue error bars 
in Fig. [1] and blue pixels in Fig. [3]) and 9.6 % by the red 
lobe, named the "red-lobe family", i.e., 17, 21, 23, and 24 
(represented with red error bars in Fig. [1] and red pixels in 
Fig. [3]). Moreover, a third family of extremely asymmetric 
profiles can be defined; class 25, 26, 30, 31, 33, and 34 
(represented with sky-blue error bars in Fig. [T] and sky-blue 
pixels in Fig. [3J form the "Q-like family" , which contains 
7.3 % of the total number of analyzed pixels. 

The classes still not considered are IN profiles presenting 
shapes that are not clearly among the three families defined 
above. In spite of this, many of these profiles still present 
evident asymmetries; the "asymmetric family" is formed by 
the class 2, 8, 11, 13, 14, 16, 18, 22, 27, and 32 which are 
29.6 % of the analyzed profiles (represented with green error 
bars in Fig. [1] and green pixels in Fig. [3]). Finally, only two 
classes show almost antisymmetric profiles; these represent 
only 6.3 % of the dataset (i.e., class 9 and 20) and are 
named the "antisymmetric family" (represented with yellow 
error bars in Fig. [T]and yellow pixels in Fig. [3]). 

Another interesting result comes from the study of the 
position of the families on the FOV. This can be done, for 
example, by studying the LOS velocity (vi os ) for the pixels 
associated to each family. Pixels with vi os < (vi os > 0) 
are placed in granules (intergranular lanes), i.e., upflow re- 
gions (downflow regions). Fig. 2] contains the result of such 
a study when calculating the statistics of vi os (as derived by 
Stokes /, see § [3J for the pixels included in the six families 
defined above. The main result derived from this study is 
that each family is related to a certain dynamics. This is 
very revealing because the classification procedure does not 
know anything about vi os because wavelength shifts were 
removed when extracting Stokes V profiles in the wave- 
length windows centered around Stokes / cores (§ 13. 
From this one learns that the plasma dynamics, as derived 
from Stokes /, i.e., the dynamics mostly associated to the 



Table 1. single run results vs. average results 



single run (Fig. [1| ten runs 


classes 


35 


35 ±3 


network 


28.6 % 


(28 ± 3) % 


blue-lobe 


17.6 % 


(15 ± 2) % 


red-lobe 


9.6 % 


(11 ± 2) % 


Q-like 


7.2 % 


(8 ± 2) % 


asymmetric 


29.6 % 


(30 ± 5) % 


antisymmetric 


6.3 % 


(7 ± 4) % 



non magnetized plasma in the pixel, is in some way related 
the shape of Stokes V profiles. One can elaborate a bit more 
on this result. The separation between the families with one 
lobe only (i.e., the blue-lobe and the red- lobe) is evident. 
On the one hand, profiles dominated by the blue lobe are 
mostly found in upflow regions, i.e., on top of the granules. 
On the other hand, profiles dominated by the red lobe are 
mostly found in downflow regions, i.e., intergranular lanes. 
The Q-like, network, and asymmetric families are found in 
downflow regions. Antisymmetric profiles are found in both 
upflow and downflow pixels with almost the same proba- 
bility. Overall, as expected, most of the polarization signals 
reside in downflows, i.e., 68.3 % of the analyzed pixels. 

Different families are also associated with different spa- 
tial extents. When calculating the area of the largest patch 
Q in the FOV formed, exclusively, by profiles of the particu- 
lar family one finds that network patches, mainly associated 
with class 3 and 4, can reach dimensions up to few tens 
arcsec 2 . All other families have typical patch dimensions 
of few tenths of 1 arcsec 2 (see for a comparison the detail 
image in Fig. |3jin which contour highlighting the granular 
scale are reported). This means that out of network regions 
the profile shape of the analyzed profiles change over very 
small scales, smaller than the typical granular scale. 

The results presented in this section were derived from a 
single k-means classification run. As specified in §[3]it is im- 
portant to study the dependence of the results on different 
random initializations. To do this, we derived the average 
abundances of the different families from ten classification 
runs; Table [Hallows one to compare the average results with 
those presented above. From this we can conclude that the 
results presented are sound and that the classification does 
not depend on the random initialization. 

The only class we did not consider so far in the presenta- 
tion of the results is class 29, representative of 1.1 % of the 
analyzed profiles. This can be considered a fake class since it 
contains anomalous profiles. Here we define as anomalous, 
for example, those profiles on which the procedure adopted 
for the definition of the input dataset (§ I3.1[) did not work 
properly. A single fake class is found in all the classification 
runs; this is always representative of about 1 % of the ana- 
lyzed profiles. This can be verified also in Table [TJ indeed by 
adding up the percentages from all families one gets 99 % 
of the analyzed profiles. 

5. Discussion 

An important point to emphasize is that the number of 
k-means classes representative of all the profiles observed 

4 Here "patch" means a group of connected pixels belonging 
to the same family. 
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by SOT/SP (35 ±3) is small compared to the total number 
of profiles in the analyzed dataset (535465). Moreover, when 
the major features of the classes are considered, the ana- 
lyzed profiles can be organized in six families (i.e., network, 
blue-lobe, red-lobe, Q-like, asymmetric, and antisymmet- 
ric) so that a few shape typologies can represent SOT/SP 
quiet Sun data. Nevertheless, it is important to note that 
almost all these shapes are strongly asymmetric. 

Most SO T/SP data have been so far int e rpreted through 
ME codes (jOrozco Suarez et al.l l2007at lAsensio Ramosl 
2009), but the k-means classes in Fig. [T] show that such 
an hypothesis has to be replaced to analyze the whole va- 
riety of profile shapes observed by SOT/SP in the quiet 
Sun. Except for the classes whose profile shapes can be rea- 
sonably considered to be quasi-antisymmetric (see § [3]) , we 
can conclude that ME codes ca nnot reproduce the observed 
profiles observed by SOT/SP. lOrozco Suarez et al.l (|2010[) 
recently showed that ME codes can be used to derive av- 
erage properties of fully resolved magnetized atmospheres 
(technically speaking with magnetic filling factors equal to 
1) even when inverting extremely asymmetric profiles. This 
good news does not eliminate the need for more refined in- 
versions. The asymmetries provide information on the un- 
resolved structure and/or gradients along the LOS of the 
magnetic field that are washed out unless the line shapes 
are adequately interpreted. Moreover, the present angular 
resolutions do not grant treating all IN magnetic features 
as fully resolved. 

These considerations can be used in a constructive way 
when thinking about future strategies for the interpreta- 
tion of spectropolarimetric data either from HINODE or 
from future telescopes. One needs different hypotheses in 
order to interpret (and fit) the different polarization pro- 
files emerging from the quiet Sun. As an example, let us 
consider two extreme cases from our k-means classifica- 
tion, e.g., class 9 and class 25. The first one, quasi- 
antisymmetric, can be interpreted adopting ME hypothe- 
ses. The second one is a class that, for example, could be 
interpreted in terms of mixed polarities in a single pixel 
of HINODE, i.e., with a two magneti c comp onent model 
atmosphere. Recently, lAsensio Ramosl (|2010h put forward 
an inversion procedure based on Bayesian techniques. This 
procedure follows dire ctly from the approach adopted in 
lAsensio Ramosl (|2009l ). based exclusively on ME hypoth- 
esis. The new code is able to automatically recognize the 
simplest model, from those provided by the user that can re- 
produce the examined polarization profiles. In other words, 
the code recognizes the minimum degree of complexity of 
a model atmosphere needed to reproduce the observations. 
This means that different inversion hypotheses can be used 
in a single inversion procedure. From our point of view such 
an approach is extremely promising because it allows to 
avoid an a-priori choice of hypotheses for the description of 
the solar photosphere that, in some cases, can turn out to 
be wrong in reproducing observations. 

Among the profile shapes in Fig. [T] many single lobe 
profiles can be re cognized . Nam ely, according to the defi- 
nition adopted in ISteinerl (|2000h . 17.6 % of the analyzed 
profiles are blue-lobe-only, while 9.6 % are red-lobe-only. 
Such extremely asy mmetric profiles have been the object 
of recent studies (iGrossmann- Doerth et aD Il988l Il989t 



Sanchez Almeida et alJ 119961: iGrossmann-Doerth et al.l 
2000t ISteinerl l2000f) . In IGrossmann-Doerth et alJ (|2000D . 
the authors pointed out two models to produce such 



profiles: the canopy model and the embedded magnetic 
flux tube model. Both models are characterized by a 
discontinuity along the LOS between the field-free com- 
ponent and the magnetized one. The main conclusion of 
this work is that both models are able to reproduce the 
observed asymmetries. More in detail, the canopy model 
needs an appropriate temperature stratification to produce 
strong asymmetries while the embedded magnetic flux 
tube model is able to produce them with a standard quiet 
Sun temperature stratification. 

The importance of the thermal properties of the solar 
atmosphere in the forma tion of s trong ly asymmetric pro- 
files is also highlighted in ISteinerl (|2000f ) as one of the main 
conclusions of the paper in which a canopy configuration 
under ME hypotheses is considered. From this study it 
follows that a temperature inversion, causing an emission 
in the magnetized layer, is needed at the magnetopause 
of the canopy to produ ce blue-lobe-onl y profi les in upflow 
regions as observed in ISigwarth et al.l (|l999l ) and in our 
analysis. Besides this case, also Q-like profiles can be pro- 
du ced thanks to e mission processes in the magnetized layer. 
In[St einerl (|2000l ) the author reported a complete atlas of 
asym metric profiles p roduced with a canopy configuration. 

In ISteinerl ((2000) Stokes V profiles emerging from a 
micro-loop smaller than the resolution element of the ob- 
servations are also discussed. With such a figure one can 
explain the formation of both blue-lobe-only and red-lobe- 
only profiles and the correlation of these two families with 
the photospheric dynamics. The micro-loop model impli- 
cates the presence of a loop in the resolution element, 
i.e., two opposite polarities corresponding to the legs (foot- 
points) of the loop. Considering this figure, Q-like profiles 
can be easily produced, moreover, it can be shown that a 
suppression of the blue (red) lobe in the emerging profiles 
in correspondence of downflows (upflows) can be obtained. 
Similarly to what was found for the canopy model, one 
can imagine the definition of an atlas of asymmetric pro- 
files from the micro-loop model. Any modification of the 
internal dynamics of the micro loop can influence the lobe 
cancellation leading either to a Q-like profile or to a profile 
partially dominated by a blue/red lobe as those we found 
in the classification. 

From our discussion it follows that adopting either a 
model with a magnetopause along the LOS or a micro-loop 
model one could, in principle, reproduce a large part of the 
profiles observed in the quiet Sun by SOT/SP. 

We have to poi nt out that the micro-loop model 
was introduced by ISteinerl (|2000T ) for the interpretation 
of 1 arcsec angular resolut i on sp ectropolarimetric data 
presented in ISigwarth et al.l ([l999) . This is an important 
point because the micro-loop is supposed to be contained 
in the resolution element of the observation. SOT/SP 
angular resolution is 0.3 arcsec, i.e., thre e times smaller 
than that of the dataset analyzed in ISigwarth et al.1 
(|1999[ ). but still many strong asymmetries compatible with 
the micro-loop model emerge in quiet Sun polarimctric 
measurements. The possible presence of micro-loops in the 
HINODE resolution element is of big interest for the solar 
community. Indeed, such a figure is in close connection with 
many recent works that have been dedicated to the study 
of loop emergen ce events in the quiet Sun as detected by 
HINO DE ( e.g.. ICenteno et all 120071: lOrozco Suarez et all 



20081: iMartmez Gonzalez fc Bellot Rubiol 
Martinez Gonzalez et al.l l2010fl 



20091 



As an example, in 
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lOrozco Suarez et al.1 



Stokes V profiles observed 



in correspondence with a loop emergence event are re- 
ported; at the beginning of the process a single blue-lobe 
profile is found. In the same work the authors presented 
preliminary inversions of single lobe profiles performed 
through a model atmosphere characterized by a strong 
dis continuity in the stratification (as in a canopy model 
of iGrossmann-Doerth et al. l l2000f ). This analysis and that 
by (Louis et al.l (|2009ft are based on SIRJUMP, a modified 
version of SIRGAUSS, which is able to deal with discon- 
tinuities along the LOS. In-depth analyses of single-lobe 
profiles o bserved by HINOD E via the SIRJUMP code are 
ongoing (jSainz Daldall2010l h 

A last possible scenario for reproducing the variety 
of Stokes V asymmetries observed in the quiet Sun is 
the MISMA scenario, i.e., a Micro- Structu red Magnetized 
Atmosphere (San chez Almeida et al.lll996f ). In this model 
the solar atmosphere is thought to be composed by 
many optically thin magnetized components with differ- 
ent plasma dynamics embedded in a field-free gas. These 
components naturally arise from the in terplay between 
convection and magne t ic fie lds (e.g., ICattaneol 19991: 
iPietarila Graham et all I2009D . [G rossmann-Doerth et al.l 
( 2000f ) correctly considered MISMAs as a generalization 
of the model they proposed when considering many tran- 
sients along the LOS. Here we point out that in the MISMA 
formula tion also mixed polarity (loop-l ike) models are re- 
quired (jSanchez Almeida fc Litesll2000t ). Namely, they are 
needed to explain the Q-like profiles and many of the one- 
lobe profiles. The MISMA hypothesis has been recently 
adopted to provide the first interpre tation of the asymme - 
tries of HINODE quiet Sun profiles (| Viticchie et alj|2011ft . 
One of the main results presented in this work is the plen- 
itude of mixed polarity pixels, i.e., 25 % of the IN pro- 
files. This value is three times the abundance of Q-like pro- 
files found here. However, mixed polarity profiles could be 
present in principle in many other classes that present ex- 
treme asymmetries. As an example, class 1 (5.2 %) could 
contain mixed polarity pixels. 

From the above discussion it follows that different 
models/hypotheses can be used for the interpretation of 
the strongly asymmetric polarization profiles observed by 
SOT/SP and, usually, different models are associated to dif- 
ferent inyereic^i_£ode^Am£ng these, the MISMA inversion 
code (jSanchez Almeidalll997t) is the only one so far used 
for the interpretation of asymmetries in SOT/SP data, and 
it turns out to be suitable for the i nterpretation of all th e 
profile shapes highlighted in Fig. [T] (jViticchie et al.l [20111) . 
From our point of view, different and independent interpre- 
tations from different analysis methods can be a key factor 
to clarify the physical picture behind the asymmetries re- 
ce ntly measured in the q uiet Sun. The approach adopted 
bv lAsensio Ramosl (|2010h clearly goes in this direction. 

By comparing the results of our classification with pre- 
vious studies on the shapes of Stokes V profiles in the 
quiet Sun we note that the amount of strongly asymmet- 
ric profiles seems to increase with the spatial res o lution 
of o bservations. Here we refer to Sigwart het al.l (|1999t) 
and ISanchez Almeida fc Litesl (|200C ) for a comparison. In 
both papers, spectropolarimetric observations were per- 
formed through the Advanced Stokes Polarimeter (ASP) 
with a spectral sampling of approximately 1.2 pm pixel -1 , 
a polarimetric sensitivity on the order of 10 -4 , and an 
angular resolution of about 1 arcsec. From their respec- 



tive analyses the authors obtained similar results on the 
abundance of strongly a s ymme tric profiles. Referring to 
ISanchez Almeida fc Litea ((20001 Fig. 4) one concludes that 
roughly 17 % of the analyzed prof iles considerabl y devi- 
ate from antisymmetric shapes; in Sig warth et al.l (|1999[) 
the percentage is of about 10 % in the quiet Sun. These 
abundances are at least a half t he abundance we obta ined 
(around 35 %, in agreement with iViticchie et al.ll2011l who 
analyzed the same dataset). We note that both the spec- 
tral sampling and the polarimetric sensitivity provided by 
ASP are suitable to go into the details of Stokes V shapes. 
Moreover, the methods adopted by the authors to study the 
Stokes V shapes can be regarded as reliable. In spite of this, 
from SOT/SP measurements with 0.3 arcsec angular resolu- 
tion and 10 -3 polarimetric sensitivity we find an increase in 
the abundance of strongly asymmetric profiles. This result 
is not surprising if we refer to the typical dimension of the 
patches associated to the IN families of our classification, 
i.e., few tenths of 1 arcsec. From this it follows that strong 
asymmetries are usually found o ver small scales . This fully 
agrees with the conclusions of ISigwarth et al.l (|1999l ) , in 
which the authors report that "The broad scatter in the 
V -parameters for the weakest V signals can be interpreted 
as a dramatic increase in the dynamic behaviour with de- 
creasing size (and fill factor) of magnetic elements". It is 
very important to remember that most of the IN pixels were 
overlooked in our analysis because of the selection criteria 
on the Stokes V amplitude. This could in principle strongly 
affect the dimension of the patches associated to each IN 
family. Indeed, by considering Stokes signals close to the 
SOT/SP noise level would considerably increase the num- 
ber of asymmetric profiles and affect the properties of IN 
family patches. 

Another result in agreement with lSigwarth et al.l (|l999f) 
is the difference in the dynamical properties of the blue- 
lobe family and red-lobe family, respectively. With respect 
to this, one has to remember that the dynamical proper- 
ties in Fig. 2] were derived exclusively from Stokes I profiles 
so that it is probably too optimistic to state that different 
dynamical properties of the non magnetized plasma can 
give rise to different Stokes V shapes. Rather, we can think 
about different field-free dynamics associated to certain at- 
mospheric configurations that are typical either of granules 
or intergranular lanes (as discussed above). 

As a final remark, we now discuss two possible appli- 
cations of the set of k-means profiles reported in Fig. [TJ 
The first one is the definition of a set of profiles that can 
be used to check whether the profiles synthesized from 
MHD simulations can be confidently considered to be rep- 
resentative of the profiles obse r ved b y HINODE. As an 
example, in IVogler fc Schiisslerl (|2007D MURaM local dy- 
namo simulation magnetic fields turn out to be extremely 
variable in polarity over very small scales. This implies 
the presence of mixed polarities over scales comparable to 
HINODE resolution, possibly producing asymmetric pro- 
files like those in SOT/SP observations. This is an im- 
portant and easy check that has not been considered so 
far when analyzing Stokes profiles produced from MURaM 
simulations. Such a check is very important because many 
MURaM snapshots have been used to test inversion codes 
employed fo r extensive analyses of HIN ODE data. As an 
example, in lOrozco Suarez et"aT1 (|2007bt ) the authors con- 
sidere d three MURaM snapshots to test their M E inversion 
code (jOrozco Suarez fc Del Toro Iniestal l2007f) . The three 
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snapshots were selected to be unipolar and with different 
values of the unsigned flux, namely, fO G, 50 G, and 200 G, 
so to take into account different magnetic regimes observ- 
able in the quiet Sun. To reproduce HINODE observations 
the finite spatial and spectral resolutions of SOT/SP were 
considered through a degradation of the synthesized data. 
The 10~ 3 polarimetric sensitivity was also taken into ac- 
count to define the noise level of the synthesized observa- 
tions. The strategy adopted by the authors can be regarded 
to be complete and rigorous, but at the same time a ques- 
tion is still open. Are the profiles produced b y the snap- 
shots selected by lOrozco Suarez et ail (|2007bl ) an exhaus- 
tive set of profiles for the description of HINODE mea- 
surements? One could answe r the question by comparing 
the synthetic profiles used bv lOrozco Suarez et al.l (|2007b[ ) 
with those portrayed in Fig. [TJ If they agree, then it would 
have two important consequences. First, the ME code has 
been tested on a complete set of profiles which is repre- 
sentative of real observations so, as a direct consequence, 
the test guarantees that an analysis of SOT/SP data per- 
formed through a ME code is reliable. The second one is 
that MURaM snapshots are able to reproduce the physics 
producing HINODE SOT/SP measurements. 



6. Conclusion 

We present the results of the k-means classification of 
HINODE SOT/SP Stokes V profiles of Fei 630.15 nm and 
Fei 630.25 nm observed in the quiet Sun. The classification 
procedure is automatic and unsupervised, and it is able to 
highlight the typical circular polarization measurements re- 
trieved by SOT/SP, and to organize them in classes. Here 
we analyze the profiles from a 302 x 162 arcsec 2 portion of 
quiet photosphere. This dataset, consisting of 535465 pro- 
files, can be regarded as a complete sample of polarization 
measurements performed at 0.3 arcsec angular resolution 
and 10~ 3 polarimetric sensitivity in the quiet Sun. 

A large variety of profile shapes emerges from the quiet 
Sun. These shapes can be typically classified in 35 k-means 
classes (Fig. Q]). Referring to the major features of the 
k-means classes, we were able to organize them in six fami- 
lies, namely, network, blue-lobe, red-lobe, Q-like, asymmet- 
ric, and antisymmetric. 

Network and IN profile classes are well separated in 
shape. Network profiles represent approximately 28 % of 
the analyzed profiles. 

Asymmetric profiles are very common in the quiet Sun, 
namely, about 93 % of the analyzed quiet Sun profiles. 
Strongly asymmetric profiles are found in the IN. They 
can be organized in three families: i) blue- lobe, ii) red- 
lobe, and Hi) Q-like profiles (similarly to the definition in 
St einerl l^OOO). These represent about 34 % of the analyzed 
profiles. Each family of profiles is associated to a certain 
photosphcric dynamics as inferred from the line-core shift 
of the intensity profiles. Blue-lobe profiles are in upflow re- 
gions, red- lobe ones are in downfiows, and Q-like profiles 
are in downfiows; these dynamical properties are probably 
related to certain magnetic configurations preferentially as- 
sociated with granules or with intergranular lanes. 

The classes associated to the network are found to be 
spatially coherent over tens of arcsec 2 . IN classes are found 
to be coherent over small scales (i.e., < 1 acrsec 2 ). From 
this we conclude that the processes producing strong asym- 



metries in Stokes V profiles take place over small scales, 
comparable to SOT/SP angular resolution. 

We put forward a possible application of the set of 
k-means cluster profiles (Fig. [1]). They contain all the 
Stokes V shapes observed in the quiet Sun at 0.3 arcsec 
resolution and with maximum absolute amplitude larger 
than 4.5 x 10~ 3 , therefore, they can be used as a standard 
reference to test inversion codes and to check the validity 
and completeness of modern MHD simulations. 

From the abundance of asymmetric profiles we con- 
clude that ME inversions cannot be considered to be ex- 
haustive for the description of the properties of the quiet 
Sun photosphere. Indeed, Stokes V profiles observed by 
HINODE SOT/SP in the quiet Sun contain unique infor- 
mation about the stratification of the solar photosphere 
th at so far have b e en ove rlooked in ME analyses. The work 
of I Viticchie et all (|2011l ) is the first attempt to try to ex- 
tract this information. However, as shown in § [51 several 
alternative interpretations are possible through hypotheses 
on the underlying atmospheres. These alternatives should 
be sought to outline the spectrum of physical scenarios for 
the description of the quiet Sun polarization measurements. 
Those features common to all inversions will be regarded 
as robust results, whereas inconsistencies must be studied 
and cleared out. 
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